Search results for "Minimum bounding box"
showing 10 items of 10 documents
Optimized spatio-temporal descriptors for real-time fall detection: comparison of support vector machine and Adaboost-based classification
2013
We propose a supervised approach to detect falls in a home environment using an optimized descriptor adapted to real-time tasks. We introduce a realistic dataset of 222 videos, a new metric allowing evaluation of fall detection performance in a video stream, and an automatically optimized set of spatio-temporal descriptors which fed a supervised classifier. We build the initial spatio-temporal descriptor named STHF using several combinations of transformations of geometrical features (height and width of human body bounding box, the user’s trajectory with her/his orientation, projection histograms, and moments of orders 0, 1, and 2). We study the combinations of usual transformations of the…
On the use of Deep Reinforcement Learning for Visual Tracking: a Survey
2021
This paper aims at highlighting cutting-edge research results in the field of visual tracking by deep reinforcement learning. Deep reinforcement learning (DRL) is an emerging area combining recent progress in deep and reinforcement learning. It is showing interesting results in the computer vision field and, recently, it has been applied to the visual tracking problem yielding to the rapid development of novel tracking strategies. After providing an introduction to reinforcement learning, this paper compares recent visual tracking approaches based on deep reinforcement learning. Analysis of the state-of-the-art suggests that reinforcement learning allows modeling varying parts of the tracki…
A Semantic Web Approach for Geodata Discovery
2013
International audience; Currently, vast amounts of geospatial information are o ffered through OGC's services. However this information has limited formal semantics. The most common method to search for a dataset consists in matching keywords to metadata elements. By adding semantics to available descriptions we could use modern inference and reasoning mechanisms currently available in the SemanticWeb. In this paper we present a novel architecture currently in development in which we use state of the art triplestores as the backend of a CSW service. In our approach, each metadata record is considered an instance of a given class in a domain ontology. Our architecture also adds a spatial dat…
Computer-aided detection of cerebral microbleeds in susceptibility-weighted imaging.
2014
Susceptibility-weighted imaging (SWI) is recognized as the preferred MRI technique for visualizing cerebral vasculature and related pathologies such as cerebral microbleeds (CMBs). Manual identification of CMBs is time-consuming, has limited reliability and reproducibility, and is prone to misinterpretation. In this paper, a novel computer-aided microbleed detection technique based on machine learning is presented: First, spherical-like objects (potential CMB candidates) with their corresponding bounding boxes were detected using a novel multi-scale Laplacian of Gaussian technique. A set of robust 3-dimensional Radon- and Hessian-based shape descriptors within each bounding box were then ex…
Autonomous Mooring towards Autonomous Maritime Navigation and Offshore Operations
2020
Bollard is a vital component of mooring system. It is the anchor point for mooring ropes to be fixed in order to secure the vessel or ship. An algorithm that translates the segmented mask of bollard output from masked R-CNN along with bounding box and associated class probability to its corresponding edge coordinate and finally to the single reference point for efficient detection and classification of bollard towards autonomous mooring is presented. At first stage, Mask R-CNN framework is trained with custom built bollard. The model obtained from the training is inferred with real data resulting in instance segment of bollard. The segmented mask obtained contains relatively large amount of…
EFFICIENT MACHINE LEARNING FRAMEWORK FOR COMPUTER-AIDED DETECTION OF CEREBRAL MICROBLEEDS USING THE RADON TRANSFORM
2014
International audience; Recent developments of susceptibility weighted MR techniques have improved visualization of venous vasculature and underlying pathologies such as cerebral microbleed (CMB). CMBs are small round hypointense lesions on MRI images that are emerging as a potential biomarker for cerebrovascular disease. CMB manual rating has limited reliability, is time-consuming and is prone to errors as small CMBs can be easily missed or mistaken for venous crosssections. This paper presents a computer-aided detection technique that utilizes a novel cascade of random forest classifiers which are trained on robust Radon-based features with an unbalanced sample distribution. The training …
Definition and Performance Evaluation of a Robust SVM Based Fall Detection Solution
2012
We propose an automatic approach to detect falls in home environment. A Support Vector Machine based classifier is fed by a set of selected features extracted from human body silhouette tracking. The classifier is followed by filtering operations taking into account the temporal nature of a video. The features are based on height and width of human body bounding box, the user's trajectory with her/his orientation, Projection Histograms and moments of order 0, 1 and 2. We study several combinations of usual transformations of the features (Fourier Transform, Wavelet transform, first and second derivatives), and we show experimentally that it is possible to achieve high performance using a si…
Alternative method for binary shape alignment of non-symmetrical shapes based on minimal enclosing box
2012
Proposed is a novel method based on the minimal enclosing box (MEB) to determine the canonical orientation associated with a three-dimensional binary shape. It is suggested that, when the shape has no clear distinctive features and two or more of the eigenvalues are similar, this method is more suitable than the commonly used method based on principal component analysis (PCA). An experiment is performed with shapes of human livers by measuring the degree on which a prototypical image (atlas) matches to a new shape after alignment by PCA, minimal area projection (MAP), and MEB showing that in this case MEB outperforms the usual PCA-based alignment method and also the MAP method.
Head Pose Estimation for Sign Language Video
2013
We address the problem of estimating three head pose angles in sign language video using the Pointing04 data set as training data. The proposed model employs facial landmark points and Support Vector Regression learned from the training set to identify yaw and pitch angles independently. A simple geometric approach is used for the roll angle. As a novel development, we propose to use the detected skin tone areas within the face bounding box as additional features for head pose estimation. The accuracy level of the estimators we obtain compares favorably with published results on the same data, but the smaller number of pose angles in our setup may explain some of the observed advantage.
Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation
2021
International audience; Deep learning-based image understanding techniques require a large number of labeled images for training. Few-shot semantic segmentation, on the contrary, aims at generalizing the segmentation ability of the model to new categories given only a few labeled samples. To tackle this problem, we propose a novel prototypical network (MAPnet) with multiscale feature attention. To fully exploit the representative features of target classes, we firstly extract rich contextual information of labeled support images via a multiscale feature enhancement module. The learned prototypes from support features provide further semantic guidance on the query image. Then we adaptively i…